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SEQUENCE LISTING 



(1) GENERAL INFORMATION: 

(i) APPLICANT: Greenspan , Daniel S 
Takahara, Kazuhiko 
Hoffman, Guy G 

10 (ii) TITLE OF INVENTION: Mammalian Tolloid-Like Protein 

(iii) NUMBER OF SEQUENCES : 13 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Quarles & Brady 

15 (B) STREET: 1 South Pinckney Street 

(C) CITY: Madison 

(D) STATE: WI 

(E) COUNTRY: US 

(F) ZIP: 53703 

JgO (v) COMPUTER READABLE FORM: 
^ (A) MEDIUM TYPE: Floppy disk 

m (B) COMPUTER: IBM PC compatible 

Cl <C) OPERATING SYSTEM: PC-DOS/MS-DOS 

f* (D) SOFTWARE: Patentln Release #1.0, 

Jf5 Version #1.30 

W (vi) CURRENT APPLICATION DATA: 
01 (A) APPLICATION NUMBER: 

Qi <B) FILING DATE: 

J (C) CLASSIFICATION: 

§§0 (viii) ATTORNEY /AGENT INFORMATION: 

S (A) NAME: Berson, Bennett U 

Hf (B) REGISTRATION NUMBER: 3 7094 

\£ (C) REFERENCE/DOCKET NUMBER : 960296.93839 

O (ix) TELECOMMUNICATION INFORMATION: 

$B (A) TELEPHONE: 608-251-5000 

<B) TELEFAX: 608-251-9166 

(2) INFORMATION FOR SEQ ID NO : 1 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 0 base pairs 
40 (B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "oligonucleotide probe" 

45 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 

ACGTCCAGAC CGGAGCGGGT GTGGCCCGAT GGGGTCATCC CGTTTGTGAT TGGAGGGAAT 60 
TTCACAGGCA GCCAGAGGGC AGTCTTCCGG CAGGCCATGA GACACTGGGA GAAGCATACC 120 
TGTGTCACCT TCTTGGAGCG CACAGATGAG GACAGCTATA TTGTATTCAC CTACCGACCC 180 
TGCGGGTGCT GCTCCTACGT GGGTCGCCGA GGTGGGGGCC CCCAGGCCAT CTCCATCGGC 240 

50 AAGAACTGTG ACAAGTTTGG CATCGTGGTC CATGAGCTGG GCCATGTCAT TGGCTTCTGG 300 

CACGAGCACA CGCGGCCCGA CCGCGACCGC 33 0 
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<2) INFORMATION FOR SBQ ID NO: 2: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 7 71 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(ix) FEATURE: 

(A) NAME /KEY: CDS 

(B) LOCATION: 611.. 3652 

(D) OTHER INFORMATION: /product^ "murine mTll protein" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

CACACCCCTT TGCTCTCCGG GCAGTCGGGA GCTTCCCTAG CTTCGGCAGG CTTTTAAGGT 60 

CTGGCGGCGT AGAAATGCCT ATCCCCCACC CCCTTCCTCG GTCTCCCCTT TCAGTTCAGA 120 

TGTGCTGATG TGCAGACCGG ATTCATCTTC CCCGAGCAGC GGCGGTGGCA GCGGCGGGCG 180 

CAGGCGGCTG CAGCTCGCTC TCGGCCGCGG GGTCCTGACA GCGGCGGGGG CGCGGCGCGG 240 

GAGCCGGAGC TCCGGTGGCA GCTGAGCCCG CCGTGCGCCT CTCGCCGCGG CCGGTCGTGA 300 

TCGCGGGAAG TTCGACCGCT GGAAGGACGA CCTAGACCGA GCCGGGTTGG CTGCGGCTGC 3 60 

CCTGCGCCGA GCTCCTCACC TGCCTTCCGC CCACCCGCGG GCCCCCGGCC AAGTTCCCCA 420 

GCATCCGGGG GAGACAGGGA GACATTTGCC CTCTCTAGCG TCCTGAAGAC ATCCGCATGT 480 

CTCCGGACAC CTGAACATTC AGGTCTTTCC GAGGAGCTTC CCAGTCGGGA TAAGAACACT 540 

GTCCCTAGAG CCCCGCATAT CCACGCGGCC CTCCGGGTCT GGTCCCCTCC TTTTCCTCTA 6 00 

GGGGAGGAGG ATG GGT TTG CAA GCG CTC TCC CCG AGG ATG CTC CTG TGG 649 
Met Gly Leu Gin Ala Leu Ser Pro Arg Met Leu Leu Trp 
15 10 

TTG GTG GTC TCG GGT ATT GTT TTC TCC CGG GTG CTG TGG GTC TGC GCT 697 
Leu Val Val Ser Gly lie Val Phe Ser Arg Val Leu Trp Val Cys Ala 
15 20 25 

GGC ' CTC GAT TAT GAT TAC ACT TTT GAT GGG AAC GAA GAG GAC AAA ACG 74 5 
Gly Leu Asp Tyr Asp Tyr Thr Phe Asp Gly Asn Glu Glu Asp Lys Thr 
30 35 40 45 

GAG CCT ATA GAT TAC AAG GAC CCG TGC AAA GCT GCT GTG TTT TGG GGT 793 
Glu Pro lie Asp Tyr Lys Asp Pro Cys Lys Ala Ala Val Phe Trp Gly 
50 55 60 

GAC ATC GCC TTA GAT GAT GAA GAC TTA AAT ATC TTC CAA ATA GAC AGG 841 
Asp lie Ala Leu Asp Asp Glu Asp Leu Asn lie Phe Gin lie Asp Arg 
65 70 75 

ACA ATT GAC CTG ACC CAG AGC CCC TTT GGA AAA CTT GGA CAT ATT ACA 889 
Thr lie Asp Leu Thr Gin Ser Pro Phe Gly Lys Leu Gly His lie Thr 
80 85 90 

GGT GGC TTT GGA GAC CAT GGC ATG CCA AAG AAG CGA GGG GCA CTC TAC 93 7 
Gly Gly Phe Gly Asp His Gly Met Pro Lys Lys Arg Gly Ala Leu Tyr 
95 100 105 



CAA CTT ATA GAG AGG ATC AGA AGA ATT GGC TCT GGC TTG GAG CAA AAT 985 
Gin Leu He Glu Arg He Arg Arg He Gly Ser Gly Leu Glu Gin Asn 
110 115 120 125 

AAC ACG ATG AAG GGA AAA GCA CCT CCA AAA TTG TCA GAG CAA AGT GAG 1033 
5 Asn Thr Met Lys Gly Lys Ala Pro Pro Lys Leu Ser Glu Gin Ser Glu 

130 135 140 

AAA AAT CGA GTT CCC AGA GCT GCT ACC TCA AGA ACG GAA AGG ATA TGG 1081 
Lys Asn Arg Val Pro Arg Ala Ala Thr Ser Arg Thr Glu Arg He Trp 
145 150 155 

10 CCT GGG GGT GTC ATT CCT TAT GTC ATA GGA GGA AAC TTT ACT GGC AGC 1129 

Pro Gly Gly Val He Pro Tyr Val He Gly Gly Asn Phe Thr Gly Ser 
160 165 170 

CAG AGA GCC ATG TTC AAG CAG GCC ATG AGA CAC TGG GAA AAG CAC ACC 1177 
Gin Arg Ala Met Phe Lys Gin Ala Met Arg His Trp Glu Lys His Thr 
15 175 180 185 

TGT GTG ACG TTC ACT GAG AGA AGT GAT GAA GAA AGT TAT ATT GTG TTC 1225 
□ Cys Val Thr Phe Thr Glu Arg Ser Asp Glu Glu Ser Tyr He Val Phe 

P 190 195 200 205 

% ACC TAC AGG CCT TGT GGA TGC TGC TCC TAT GTT GGT CGG CGG GGA AAT 1273 

420 Thr Tyr Arg Pro Cys Gly Cys Cys Ser Tyr Val Gly Arg Arg Gly Asn 

210 215 220 



^ GGC CCT CAG GCC ATC TCT ATT CGC? AAG AAC TGT GAC AAG TTT GGA ATT 1321 

Gly Pro Gin Ala He Ser He Gly Lys Asn Cys Asp Lys Phe Gly He 

yi 225 230 235 

35 GTT GTT CAT GAA CTG GGC CAC GTG ATA GGC TTC TGG CAT GAA CAT ACC 1369 

Val Val His Glu Leu Gly His Val He Gly Phe Trp His Glu His Thr 

21 240 245 250 

0 OGC CCA GAC CGA GAC AAC CAT GTC ACC ATC ATT AGA GAG AAC ATC CAG 1417 

f3 Arg Pro Asp Arg Asp Asn His Val Thr He He Arg Glu Asn lie Gin 

3[0 255 260 265 

CCA GGT CAA GAG TAC AAT TTT CTA AAG ATG GAG CCT GGA GAA GTG AAC 1465 
Pro Gly Gin Glu Tyr Asn Phe Leu Lys Met Glu Pro Gly Glu Val Asn 
270 275 280 285 

TCT CTT GGG GAA AGA TAT GAT TTT GAC AGT ATC ATG CAC TAC GCC AGG 1513 
3 5 Ser Leu Gly Glu Arg Tyr Asp Phe Asp Ser He Met His Tyr Ala Arg 

290 295 300 

AAC ACC TTC TCA AGA GGG ATG TTT TTA GAC ACA ATA CTC CCC TCC CGT 1561 
Asn Thr Phe Ser Arg Gly Met Phe Leu Asp Thr He Leu Pro Ser Arg 
305 310 315 

40 GAT GAT AAT GGC ATT CGT CCT GCA ATT GGT CAA CGG ACC CGG TTA AGC 1609 

Asp Asp Asn Gly He Arg Pro Ala He Gly Gin Arg Thr Arg Leu Ser 
320 325 330 

AAA GGA GAC ATT GCA CAA GCA AGA AAG CTG TAT CGA TGC CCA GCA TGT 1657 
Lys Gly Asp lie Ala Gin Ala Arg Lys Leu Tyr Arg Cys Pro Ala Cys 
45 335 340 345 

GGA GAA ACC CTG CAA GAA TCC AGT GGC AAC CTT TCT TCC CCA GGA TTC 17 05 
Gly Glu Thr Leu Gin Glu Ser Ser Gly Asn Leu Ser Ser Pro Gly Phe 
350 355 360 365 

CCA AAT GGC TAC CCT TCC TAC ACA CAC TGC ATC TGG AGA GTG TCT GTG 1753 
50 Pro Asn Gly Tyr Pro Ser Tyr Thr His Cys He Trp Arg Val Ser Val 

370 375 380 
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ACC CCG GGA GAA AAG ATT GTC TTG AAT TTT ACC AC A ATG GAC CTT TAC 1801 
Thr Pro Gly Glu Lys He Val Leu Asn Phe Thr Thr Met Asp Leu Tyr 
385 390 395 

AAA AGT AGT TTG TGC TGG TAT GAT TAC ATT GAA GTA AGA GAT GGT TAC 184 9 
5 Lys Ser Ser Leu Cys Trp Tyr Asp Tyr He Glu Val Arg Asp Gly Tyr 

400 405 410 

TGG AGG AAG TCA CCT CTC CTT GGT AGA TTC TGT GGG GAC AAA GTG GCT 1897 
Trp Arg Lys Ser Pro Leu Leu Gly Arg Phe Cys Gly Asp Lys Val Ala 
415 420 425 

10 GGA GTT CTT ACA TCT ACG GAC AGC AGA ATG TGG ATT GAG TTT CGT AGC 1945 

Gly Val Leu Thr Ser Thr Asp Ser Arg Met Trp He Glu Phe Arg Ser 
430 435 440 445 

AGC AGT AAC TGG GTA GGA AAA GGG TTT GCA GCT GTC TAT GAA GCG ATT 1993 
Ser Ser Asn Trp Val Gly Lys Gly Phe Ala Ala Val Tyr Glu Ala He 
15 450 455 460 

TGT GGA GGG GAG ATA AGG AAA AAC GAA GGG CAG ATT CAG TCT CCC AAT 2041 
O Cys Gly Gly Glu He Arg Lys Asn Glu Gly Gin He Gin Ser Pro Asn 

y§ 465 470 475 

TAC CCC GAT GAC TAC CGA CCA ATG AAG GAG TGT GTA TGG AAA ATA ATG 2089 
4fi0 Tyr Pro Asp Asp Tyr Arg Pro Met Lys Glu Cys Val Trp Lys He Met 

PJ 480 485 490 

KS GTG TCC GAG GGC TAC CAT GTT GGA CTG ACC TTT CAG GCC TTT GAG ATC 213 7 

Val Ser Glu Gly Tyr His Val Gly Leu Thr Phe Gin Ala Phe Glu lie 
^= 495 500 505 

ffsS GAA AGA CAT GAC AGC TGT GCC TAT GAC CAC CTA GAA GTT CGA GAT GGA 2185 

iJJ Glu Arg His Asp Ser Cys Ala Tyr Asp His Leu Glu Val Arg Asp Gly 

*H 510 515 520 525 

03 GCC AGT GAG AAC AGC CCT TTG ATA GGA CGG TTC TGT GGT TAT GAC AAA 223 3 
Ala Ser Glu Asn Ser Pro Leu He Gly Arg Phe Cys Gly Tyr Asp Lys 

3t) 530 535 540 

CCT GAA GAT ATA AGG TCT ACT TCC AAC ACC CTG TGG ATG AAG TTT GTC 2281 
Pro Glu Asp He Arg Ser Thr Ser Asn Thr Leu Trp Met Lys Phe Val 
545 550 555 

TCT GAC GGG ACT GTG AAC AAG GCA GGG TTT GCT GCG AAC TTT TTT AAA 2329 
35 Ser Asp Gly Thr Val Asn Lys Ala Gly Phe Ala Ala Asn Phe Phe Lys 

560 565 570 

GAG GAA GAT GAG TGT GCC AAA CCT GAC CGA GGA GGC TGT GAA CAG AGG 2377 
Glu Glu Asp Glu Cys Ala Lys Pro Asp Arg Gly Gly Cys Glu Gin Arg 
575 580 585 

4 0 TGT CTT AAC ACA CTA GGC AGC TAC CAG TGT GCC TGT GAG CCT GGC TAT 2425 

Cys Leu Asn Thr Leu Gly Ser Tyr Gin Cys Ala Cys Glu Pro Gly Tyr 
590 595 600 605 

GAA CTG GGG CCA GAC AGA AGA AGC TGT GAA GCT GCT TGC GGA GGA CTT 2473 
Glu Leu Gly Pro Asp Arg Arg Ser Cys Glu Ala Ala Cys Gly Gly Leu 
45 610 615 620 

CTG ACG AAG CTC AAT GGC ACC ATA ACC ACC CCC GGC TGG CCC AAA GAG 2521 
Leu Thr Lys Leu Asn Gly Thr He Thr Thr Pro Gly Trp Pro Lys Glu 

625 630 635 

TAC CCT CCA AAC AAA AAC TGT GTG TGG CAA GTG ATC GCG CCA AGC CAG 2 569 
50 Tyr Pro Pro Asn Lys Asn Cys Val Trp Gin Val He Ala Pro Ser Gin 

640 645 650 
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TAC AGA ATC TCT GTG AAG TTT GAG TTT TTT GAA TTG GAA GGC AAT GAA 2617 
Tyr Arg lie Ser Val Lys Phe Glu Phe Phe Glu Leu Glu Gly Asn Glu 
655 660 665 

GTT TGC AAA TAC GAT TAC GTG GAG ATC TGG AGC GGC CCT TCC TCT GAG 2665 
5 Val Cys Lys Tyr Asp Tyr Val Glu He Trp Ser Gly Pro Ser Ser Glu 

670 675 680 685 

TCT AAA CTG CAT GGC AAG TTC TGT GGC GCT GAC ATA CCT GAA GTG ATG 2713 
Ser Lys Leu His Gly Lys Phe Cys Gly Ala Asp He Pro Glu Val Met 
690 695 700 

10 ACT TCC CAT TTC AAC AAC ATG AGG ATT GAA TTC AAG TCA GAC AAC ACT 2761 

Thr Ser His Phe Asn Asn Met Arg He Glu Phe Lys Ser Asp Asn Thr 
705 710 715 

GTA TCC AAG AAG GGC TTC AAA GCA CAT TTT TTC TCA GAT AAG GAT GAG 2809 
Val Ser Lys Lys Gly Phe Lys Ala His Phe Phe Ser Asp Lys Asp Glu 
15 720 725 730 

TGT TCA AAG GAT AAT GGT GGC TGT CAG CAT GAG TGT GTC AAC ACG ATG 2857 
h:? Cys Ser Lys Asp Asn Gly Gly Cys Gin His Glu Cys Val Asn Thr Met 

0 735 740 745 

Jg GGA AGT TAC ACG TGT CAG TGC CGG AAT GGA TTC GTG TTG CAT GAG AAC 2 905 

f 5 |20 Gly Ser Tyr Thr Cys Gin Cys Arg Asn Gly Phe Val Leu His Glu Asn 

750 755 760 765 

P AAG CAT GAT TGC AAG GAA GCC GAG TGT GAA CAG AAG ATA CAC AGC CCA 2953 

01 I>ys His Asp Cys Lys Glu Ala Glu Cys Glu Gin Lys He His Ser Pro 
T 770 775 780 

M25 AGT GGT CTC ATC ACC AGT CCC AAC TGG CCA GAC AAG TAT CCA AGC AGG 3001 

|J3 Ser Gly Leu He Thr Ser Pro Asn Trp Pro Asp Lys Tyr Pro Ser Arg 

fil 785 790 795 

J! AAA GAG TGC ACG TGG GTG ATC AGT GCC ATT CCT GGC CAC CGC ATC ACA 3 049 

W Lys Glu Cys Thr Trp Val He Ser Ala He Pro Gly His Arg He Thr 

HBO 800 805 810 

TTA GCC TTC AAT GAG TTT GAG GTT GAA CAA CAT CAA GAA TGT GCT TAT 3 097 
Leu Ala Phe Asn Glu Phe Glu Val Glu Gin His Gin Glu Cys Ala Tyr 
815 820 825 

GAT CAC TTG GAA ATT TTT GAT GGA GAA ACG GAG AAG TCA CCA ATA CTT 3145 
35 Asp His Leu Glu He Phe Asp Gly Glu Thr Glu Lys Ser Pro He Leu 

830 835 840 845 

GGC CGA CTA TGT GGC AGC AAG ATA CCA GAT CCC CTC ATG GCT ACT GGG 3193 
Gly Arg Leu Cys Gly Ser Lys He Pro Asp Pro Leu Met Ala Thr Gly 
850 855 860 

40 AAT GAA ATG TTT ATT CGG TTT ATT TCT GAT GCC TCT GTT CAA AGA AAA 3241 

Asn Glu Met Phe He Arg Phe He Ser Asp Ala Ser Val Gin Arg Lys 
865 870 875 

GGC TTT CAA GCT ACA CAT TCC ACA GAG TGT GGT GGT CGA TTG AAA GCA 3289 
Gly Phe Gin Ala Thr His Ser Thr Glu Cys Gly Gly Arg Leu Lys Ala 
45 880 885 890 

GAG TCA AAG CCT AGA GAC CTG TAC TCC CAT GCT CAG TTT GGT GAT AAT 3 337 
Glu Ser Lys Pro Arg Asp Leu Tyr Ser His Ala Gin Phe Gly Asp Asn 
895 900 905 

AAC TAC CCA GGA CAA CTG GAC TGT GAA TGG TTG TTG GTG TCA GAA CGA 3385 
50 Asn Tyr Pro Gly Gin Leu Asp Cys Glu Trp Leu Leu Val Ser Glu Arg 

310 915 920 925 



-28- 



GGA TCT CGA CTT GAA TTG TCC TTC CAG ACA TTC GAA GTA GAA GAA GAA 3433 
Gly Ser Arg Leu Glu Leu Ser Phe Gin Thr Phe Glu Val Glu Glu Glu 
930 935 940 

GCT GAC TGT GGC TAT GAC TAT GTT GAA GTC TTT GAT GGT CTC AGT TCA 34 81 
Ala Asp Cys Gly Tyr Asp Tyr Val Glu Val Phe Asp Gly Leu Ser Ser 
945 950 955 

AAA GCT GTG GGT CTT GGT AGA TTC TGT GGG TCA GGG CCA CCA GAA GAA 3529 
Lys Ala Val Gly Leu Gly Arg Phe Cys Gly Ser Gly Pro Pro Glu Glu 
960 965 970 

ATC TAT TCA ATT GGA GAT GTG GCT TTG ATT CAT TTC CAC ACA GAT GAC 3 577 
lie Tyr Ser lie Gly Asp Val Ala Leu He His Phe His Thr Asp Asp 
975 980 985 

ACT ATC AAC AAG AAA GGA TTT TAT ATA AGA TAT AAA AGT ATA AGA TAC 3625 
Thr He Asn Lys Lys Gly Phe Tyr He Arg Tyr Lys Ser He Arg Tyr 
990 995 1000 1005 

CCG GAA ACG ATG CAT GCC AAG AAC TAA TGCCGACCCT CCCTCAGAAC 3672 
Pro Glu Thr Met His Ala Lys Asn * 
1010 

AGAAAGGAAT GTGCATATGG AAAGAAGACA TTTTTAAAAT AGACAATATT AATACAATTG 3732 
TTTTATATAA TGAATTTGAG CAAAAGAAAC CTGCAAGATT AGAGTTATCT CTGAAGTGAA 3792 
AGAGAACTTT CCAGAAAACC TGATTGGCAT TGCAAGGATG TTTGAAAGTC ATGCTTGTTC 3852 
AAGGAAGATT AACAGCTTGA AATAGATGCT TCACATTTTG GACAGTTCAT TTAATGAGCT 3912 
GTGATTCTCT GGAGTGATTT CTTGACTACT TTTCCAAGAT CTGGGGACGT AGAAATGATG 3972 
GGACGGATCA TAGCTTGGAA ACCCACTTCT TGGGTCTTAG CATGTTTGCT TAGACTCTGT 4032 
AGGTCAGACA CAGTGTAAAC CAAATTCATG TAAGGTGATG TGGAATAGTG GTCTTTGGAA 4092 
GGTGGTTCAT CATTTAAATG TAGGTTTGTG CTTGTGTGTA TGTTCACATA TGCAAGTGTG 4152 
TGTGTGTGTG TGTGTGTGTG TGTGTGTGTG TGTGTGCGTG TGTGTGTGTG TGCGTGTGTG 4212 
TGTGTGTGCA TGTGTGTGCA TGTGTGTTTG GAAACTGGAA TATTTCATCT TCATTATTTT 4272 
CAAATGCAGG CCAGCTTAAC CTGTTCACAC AAATGATTTT GTGACCACTT CATTGTATCT 4332 
GTATCTTGAG AAGTTTGAAA TATCTATAGT GTCTACAATG CAGTTAATCC CTAGATATCG 43 92 
GATAATACCC AGTTCACTAG TAAACTCATT TCTCTCTGGG GAAGTGCTGA ATAGTCTCCA 4452 
AATTCAAGAA ACTCACCATG TCTTATAAAC CTTTAAGATA AAATTCCAAC GAGGTGTGTG 4512 
CAGCCATCTT CCAAATGACT GCCTGGATGT TTCTTAGTCC AGTTACTACT GCTGCTGCTA 4572 
TTGGTCTTTC TTTTATTGTT AATGTGTTGA TATGTTGTTA TTATTATGGT TATTATTATT 4632 
GATGTTGTTA CTATATTTAA AAATGATGAG ATGAAGTGGA AGTAGAGTTT GGGAGAAATG 46 92 
AAATCTCTCT TTTTTGTTCT CTTCTTGAAA TTCAGTTTCA AAAAATACAA TATTGGGTGG 4752 
CAAAAAAAAA AAAAAAAAA 4771 



(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1014 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

Met Gly Leu Gin Ala Leu Ser Pro Arg Met Leu Leu Trp Leu Val Val 
1 5 io 



15 



Ser Gly He Val Phe Ser Arg Val Leu Trp Val Cys Ala Gly Leu Asp 
20 25 30 

Tyr Asp Tyr Thr Phe Asp Gly Asn Glu Glu Asp Lys Thr Glu Pro lie 
35 40 45 

Asp Tyr Lys Asp Pro Cys Lys Ala Ala Val Phe Trp Gly Asp He Ala 
5 ° 55 60 

Leu Asp Asp Glu Asp Leu Asn He Phe Gin He Asp Arg Thr He Asp 
65 ™ 75 80 

Leu Thr Gin Ser Pro Phe Gly Lys Leu Gly His He Thr Gly Gly Phe 
85 90 95 

Gly Asp His Gly Met Pro Lys Lys Arg Gly Ala Leu Tyr Gin Leu He 
100 105 no 

Glu Arg He Arg Arg He Gly Ser Gly Leu Glu Gin Asn Asn Thr Met 
115 120 
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Lys Gly Lys Ala Pro Pro Lys Leu Ser Glu Gin Ser Glu Lys Asn Atq 
13 0 135 140 

Val Pro Arg Ala Ala Thr Ser Arg Thr Glu Arg He Trp Pro Gly Gly 
145 150 155 i 6 o 

Val He Pro Tyr Val He Gly Gly Asn Phe Thr Gly Ser Gin Arg Ala 
165 170 1 75 

Met Phe Lys Gin Ala Met Arg His Trp Glu Lys His Thr Cys Val Thr 
180 185 190 

Phe Thr Glu Arg Ser Asp Glu Glu Ser Tyr He Val Phe Thr Tyr Arcr 
195 2 00 205 

Pro Cys Gly Cys Cys Ser Tyr Val Gly Arg Arg Gly Asn Gly Pro Gin 
210 215 220 

Ala He Ser He Gly Lys Asn Cys Asp Lys Phe Gly He Val Val His 
225 230 235 240 

Glu Leu Gly His Val He Gly Phe Trp His Glu His Thr Arg Pro Asp 
245 250 255 

Arg Asp Asn His Val Thr He He Arg Glu Asn He Gin Pro Gly Gin 
260 265 270 

Glu Tyr Asn Phe Leu Lys Met Glu Pro Gly Glu Val Asn Ser Leu Gly 
275 280 285 

Glu Arg Tyr Asp Phe Asp Ser He Met His Tyr Ala Arg Asn Thr Phe 

295 300 
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Ser Arg Gly Met Phe Leu Asp Thr lie Leu Pro Ser Arg Asp Asp Asn 
305 310 315 320 

Gly lie Arg Pro Ala He Gly Gin Arg Thr Arg Leu Ser Lys Gly Asp 
325 330 335 

He Ala Gin Ala Arg Lys Leu Tyr Arg Cys Pro Ala Cys Gly Glu Thr 
340 345 350 

Leu Gin Glu Ser Ser Gly Asn Leu Ser Ser Pro Gly Phe Pro Asn Gly 
355 360 365 

Tyr Pro Ser Tyr Thr His Cys He Trp Arg Val Ser Val Thr Pro Gly 
370 375 380 

Glu Lys He Val Leu Asn Phe Thr Thr Met Asp Leu Tyr Lys Ser Ser 
385 390 395 400 

Leu Cys Trp Tyr Asp Tyr He Glu Val Arg Asp Gly Tyr Trp Arg Lys 
405 410 415 

Ser Pro Leu Leu Gly Arg Phe Cys Gly Asp Lys Val Ala Gly Val Leu 
420 425 430 

Thr Ser Thr Asp Ser Arg Met Trp He Glu Phe Arg Ser Ser Ser Asn 
435 440 445 

Trp Val Gly Lys Qly Phe Ala Ala Val Tyr Glu Ala He Cys Gly Gly 
450 455 460 

Glu He Arg Lys Asn Glu Gly Gin He Gin Ser Pro Asn Tyr Pro Asp 
465 470 475 AS0 

Asp Tyr Arg Pro Met Lys Glu Cys Val Trp Lys He Met Val Ser Glu 
485 490 495 

Gly Tyr His Val Gly Leu Thr Phe Gin Ala Phe Glu He Glu Arg His 
500 505 510 

Asp Ser Cys Ala Tyr Asp His Leu Glu Val Arg Asp Gly Ala Ser Glu 
515 520 525 

Asn Ser Pro Leu He Gly Arg Phe Cys Gly Tyr Asp Lys Pro Glu Asp 
530 535 540 

He Arg Ser Thr Ser Asn Thr Leu Trp Met Lys Phe Val Ser Asp Gly 
545 550 555 560 

Thr Val Asn Lys Ala Gly Phe Ala Ala Asn Phe Phe Lys Glu Glu Asp 
565 570 575 

Glu Cys Ala Lys Pro Asp Arg Gly Gly Cys Glu Gin Arg Cys Leu Asn 
580 585 590 

Thr Leu Gly Ser Tyr Gin Cys Ala Cys Glu Pro Gly Tyr Glu Leu Gly 
595 600 605 

Pro Asp Arg Arg Ser Cys Glu Ala Ala Cys Gly Gly Leu Leu Thr Lys 
610 615 620 

Leu Asn Gly Thr He Thr Thr Pro Gly Trp Pro Lys Glu Tyr Pro Pro 
625 630 635 640 

Asn Lys Asn Cys Val Trp Gin Val He Ala Pro Ser Gin Tyr Arg He 
645 650 655 



Ser Val Lys Phe Glu Phe Phe Glu Leu Glu Gly Asn Glu Val Cys Lys 
660 665 670 

Tyr Asp Tyr Val Glu lie Trp Ser Gly Pro Ser Ser Glu Ser Lys Leu 
675 680 685 

5 His Gly Lys Phe Cys Gly Ala Asp lie Pro Glu Val Met Thr Ser His 

690 695 700 

Phe Asn Asn Met Arg lie Glu Phe Lys Ser Asp Asn Thr Val Ser Lys 
705 710 715 720 

Lys Gly Phe Lys Ala His Phe Phe Ser Asp Lys Asp Glu Cys Ser Lys 
10 725 730 735 

Asp Asn Gly Gly Cys Gin His Glu Cys Val Asn Thr Met Gly Ser Tyr 
740 745 750 

Thr Cys Gin Cys Arg Asn Gly Phe Val Leu His Glu Asn Lys His Asp 
755 760 765 

=3-5 Cys Lys Glu Ala Glu Cys Glu Gin Lys lie His Ser Pro Ser Gly Leu 

770 775 780 

lie Thr Ser Pro Asn Trp Pro Asp Lys Tyr Pro Ser Arg Lys Glu Cys 
4* 785 790 795 800 

i-.l Thr Trp Val lie £er Ala lie Pro Gly His Arg He Thr Leu Ala Phe 

JSfO 805 810 815 

01 Asn Glu Phe Glu Val Glu Gin His Gin Glu Cys Ala Tvr Asp His Leu 

5 820 825 830 

Jj Glu He Phe Asp Gly Glu Thr Glu Lys Ser Pro He Leu Gly Arg Leu 

^ 835 840 845 

ru 

-JSS Cys Gly Ser Lys He Pro Asp Pro Leu Met Ala Thr Gly Asn Glu Met 

5 850 855 860 

Phe He Arg Phe He Ser Asp Ala Ser Val Gin Arg Lys Gly Phe Gin 
865 870 875 880 

Ala Thr His Ser Thr Glu Cys Gly Gly Arg Leu Lys Ala Glu Ser Lys 
30 885 890 895 

Pro Arg Asp Leu Tyr Ser His Ala Gin Phe Gly Asp Asn Asn Tyr Pro 
900 905 910 

Gly Gin Leu Asp Cys Glu Trp Leu Leu Val Ser Glu Arg Gly Ser Arg 
915 920 925 - - 

35 Leu Glu Leu Ser Phe Gin Thr Phe Glu Val Glu Glu Glu Ala Asp Cys 

930 935 940 

Gly Tyr Asp Tyr Val Glu Val Phe Asp Gly Leu Ser Ser Lys Ala Val 
945 950 955 960 

Gly Leu Gly Arg Phe Cys Gly Ser Gly Pro Pro Glu Glu He Tyr Ser 
40 965 970 975 

He Gly Asp Val Ala Leu He His Phe His Thr Asp Asp Thr He Asn 
980 985 990 

Lys Lys Gly Phe Tyr He Arg Tyr Lys Ser He Arg Tyr Pro Glu Thr 
995 1000 1005 
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Met His Ala Lys Asn * 
1010 



(2) INFORMATION FOR SEQ ID NO:4: 

(i) SEQUENCE CHARACTERISTICS: 
5 (A) LENGTH: 3919 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

10 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo sapiens 

<ix) FEATURE: 

(A) NAME / KEY : CDS 

(B) LOCATION: 648.. 3689 

O 5 (D) OTHER INFORMATION: /product- "human mTll protein" 

15 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:4: 

ril CTCACACTTT TGCTCTCTTG CAGTCAGTTG CTTTGCTGGC TTCTGCAGGC TTTTAAGGTC 60 

|f| TCGCGGCGTA GAAATGCCTG GCCCCCACCC CCTTCCTCGG TCTCCCCTTT CAATTCAGAT 120 

^ GTGCTGATGT GCAGACCGGA TTCATCTTCT CGGAGCTGCG GCGGCGGCTT TGGGCTCAGG 18 0 

g|0 CGGCGGCGGC TCGCGCTCGG CCGCGGAGTC CTGGCAGCAG CGGGGACGCG GCGCGGGAGT 240 

m CCGAGCTCTG GTGGCAGCTG AGCCCGCGGG GCGCCGCTCG CCGAGCCGCG GCCGCGGGAA 300 

tfj * GTTCGGCAGC CAGAAGGACG ACCTGGCAGG CTGCGAGCGC CAGCGCCGCC AGAGCCGAGT 36 0 

TTGCCTGCGC CCTCCCCGCC TCCGAGTGCA GAGTTCCTTA CCTGCCCTCC GCCCACCCGT 42 0 

GGGCCCCTAG CCAACTTCTC CCTGCGACTG GGGGTAACAG GCAGTGCTTG CCCTCTCTAC 48 0 

25 TGTCCCGGCG GCATCCACAT GTTTCCGGAC ACCTGAGCAC CCCGGTCCCG CCGAGGAGCC 54 0 

TCCGGGTGGG GAGAAGAGCA CCGGTGCCCC TAGCCCCGCA CATCAGCGCG GACCGCGGCT 6 00 

GCCTAACCTC TGGGTCCCGT CCCCTCCTTT TCCTCCGGGG GAGGAGG ATG GGG TTG 656 

Met Gly Leu 
1015 
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GGA ACG CTT TCC CCG AGG ATG CTC GTG TGG CTG GTG GCC TCG GGG ATT 7 04 
Gly Thr Leu Ser Pro Arg Met Leu Val Trp Leu Val Ala Ser Gly He 
1020 1025 1030 



GTT TTC TAC GGG GAG CTA TGG GTC TGC GCT GGC ] CTC GAT TAT GAT TAC 752 
Val Phe Tyr Gly Glu Leu Trp Val Cys Ala Gly Leu Asp Tyr Asp Tyr 
35 1035 1040 1045 

ACT TTT GAT GGG AAC GAA GAG GAT AAA ACA GAG ACT ATA GAT TAC AAG 8 00 
Thr Phe Asp Gly Asn Glu Glu Asp Lys Thr Glu Thr He Asp Tyr Lys 
1050 1055 10 60 1065 

GAC CCG TGT AAA GCC GCT GTA TTT TGG GGC GAT ATT GCC TTA GAT GAT 848 
4 0 Asp Pro Cys Lys Ala Ala Val Phe Trp Gly Asp He Ala Leu Asp Asp 

1070 1075 1080 
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GAA GAC TTA AAT ATC TTT CAA ATA GAT AGG ACA ATT GAC CTT ACG CAG 896 
Glu Asp Leu Asn lie Phe Gin lie Asp Arg Thr He Asp Leu Thr Gin 
1085 1090 1095 

AAC CCC TTT GGA AAC CTT GGA CAT ACC ACA GGT GGA CTT GGA GAC CAT 944 
Asn Pro Phe Gly Asn Leu Gly His Thr Thr Gly Gly Leu Gly Asp His 
1100 H05 mo 

GCT ATG TCA AAG AAG CGA GGG GCC CTC TAC CAA CTT ATA GAC AGG ATA 992 
Ala Met Ser Lys Lys Arg Gly Ala Leu Tyr Gin Leu He Asp Arcr He 
HIS 1120 ii 2 5 

AGA AGA ATT GGC TTT GGC TTG GAG CAA AAC AAC ACA GTT AAG GGA AAA 104 0 
Arg Arg He Gly Phe Gly Leu Glu Gin Asn Asn Thr Val Lys Gly Lys 
1130 H35 H40 1145 

GTA CCT CTA CAA TTC TCA GGG CAA AAT GAG AAA AAT CGA GTT CCC AGA 1088 
Val Pro Leu Gin Phe Ser Gly Gin Asn Glu Lys Asn Arg Val Pro Arg X 
I 150 H55 H60 

GCC GCT ACA TCA AGA ACG GAA AGA ATA TGG CCT GGA GGC GTT ATT CCT 1136 
Ala Ala Thr Ser Arg Thr Glu Arg He Trp Pro Gly Gly Val He Pro 
H65 1170 1175 

TAT GTT ATA GGA GGA AAC TTC ACT GGC AGC CAG AGA GCC ATG TTC AAG 1184 
Tyr Val He Gly Gly Asn Phe Thr Gly Ser Gin Arg Ala Met Phe Lys 
1180 H85 H90 

CAG GCC ATG AGG CAC TGG GAA AAG CAC ACA TGT GTG ACT TTC ATA GAA 1232 
Gin Ala Met Arg His Trp Glu Lys ' His Thr Cys Val Thr Phe He Glu 
US 5 1200 1205 

AGA AGT GAT GAA GAG AGT TAC ATT GTA TTC ACC TAT AGG CCT TGT GGA 128 0 
Arg Ser Asp Glu Glu Ser Tyr He Val Phe Thr Tyr Arg Pro Cys Gly 
1210 1215 1220 1225 

TGC TGC TCC TAT GTA GGT CGG CGA GGA AAT GGA CCT CAG GCA ATC TCT 132 8 
Cys Cys Ser Tyr Val Gly Arg Arg Gly Asn Gly Pro Gin Ala He Ser 
I 230 1235 1240 

ATC GGC AAG AAC TGT GAT AAA TTT GGG ATT GTT GTT CAT GAA TTG GGT 1376 
He Gly Lys Asn Cys Asp Lys Phe Gly He Val Val His Glu Leu Gly 
1245 1250 1255 

CAT GTG ATA GGC TTT TGG CAT GAA CAC ACA AGA CCA GAT CGA GAT AAC 1424 
His Val He Gly Phe Trp His Glu His Thr Arg Pro Asp Arg Asp Asn 
1260 1265 1270 

CAC GTA ACT ATC ATA AGA GAA AAC ATC CAG CCA GGT CAA GAG TAC AAT 14 72 
His Val Thr He He Arg Glu Asn He Gin Pro Gly Gin Glu Tyr Asn 
1275 1280 1285 

TTT CTG AAG ATG GAG CCT GGA GAA GTA AAC TCA CTT GGA GAA AGA TAT 1520 

foL LyS Met Glu Pr ° G1 ^ Glu Val Asn Ser Leu Gly Glu Arg Tyr 
1290 1295 1300 y 1305 

GAT TTC GAC AGT ATC ATG CAC TAT GCC AGG AAC ACC TTC TCA AGG GGG 1 SfiR 
Asp Phe Asp Ser He Met His Tyr Ala Arg Asn Thr Se Ser* JSg 25 
1310 1315 1320 

ATG TTT CTG GAT ACC ATT CTC CCC TCC CGT GAT GAT AAT GGC ATA CGT 1616 
Met Phe Leu Asp Thr He Leu Pro Ser Arg Asp Asp Asn Giy nj ArJ 
1325 i 33 o 1335 

Pro A?J ill Oil ^ n GA ^ C ^ CTA AGC ^ GGA GAT ATC OCA CAG 1664 
Pro Ala lie Gly Gin Arg Thr Arg Leu Ser Lys Gly Asp He Ala Gin 
1340 i 3 45 i3 5 o 
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GCA AGA AAG CTG TAT AGA TGT CCA GCA TGT GGA GAA ACT CTA CAA GAA 1712 
Ala Arg Lys Leu Tyr Arg Cys Pro Ala Cys Gly Glu Thr Leu Gin Glu 
1355 1360 1365 

TCC AAT GGC AAC CTT TCC TCT CCA GGA TTT CCC AAT GGC TAC CCT TCT 1760 
b Ser Asn Gly Asn Leu Ser Ser Pro Gly Phe Pro Asn Gly Tyr Pro Ser 

1370 1375 1380 1385 

TAC ACA CAC TGC ATC TGG AGA GTT TCT GTG ACC CCA GGG GAG AAG ATT 1808 
Tyr Thr Hxs Cys lie Trp Arg Val Ser Val Thr Pro Gly Glu Lys He 
1390 1395 1400 

10 GTT TTA AAT TTT ACA ACG ATG GAT CTA TAC AAG AGT AGT TTG TGC TGG 1856 

Val Leu Asn Phe Thr Thr Met Asp Leu Tyr Lys Ser Ser Leu Cys Trp 
1405 1410 1415 

TAT GAC TAT ATT GAA GTA AGA GAC GGG TAC TGG AGA AAA TCA CCT CTC 1904 
Tyr Asp Tyr He Glu Val Arg Asp Gly Tyr Trp Arg Lys Ser Pro Leu 
15 1420 1425 1430 

O CTT GGT AGA TTC TGT GGG GAC AAA TTG CCT GAA GTT CTT ACT TCT ACA 1952 

*Q Leu Gly Arg Phe Cys Gly Asp Lys Leu Pro Glu Val Leu Thr Ser Thr 
Cl 1435 1440 1445 

L GAC AGC AGA ATG TQ G ATT GAG TTT CGT AGC AGC AGT AAT TGG GTA GGA 2000 

s*4V Asp Ser Arg Met Trp He Glu Phe Arg Ser Ser Ser Asn Trp Val Gly 

yy 1450 1455 1460 1465 

^ AAA GGC TTT GCA GCT GTC TAT GAA GCG ATC TGT GGA GGT GAG ATA CGT 2048 

L Y S G1 Y phe A la Ala Val Tyr Glu Ala He Cys Gly Gly Glu He Arg 
J_ 147 0 1475 1480 

g§5 AAA AAT GAA GGA CAG ATT CAG TCT CCC AAT TAT CCT GAT GAC TAT CGC 2096 

ni Lys Asn Glu Gly Gin He Gin Ser Pro Asn Tyr Pro Asp Asp Tyr Ara 

IS - 1485 1490 1495 

Q CCG ATG AAA GAA TGT GTG TGG AAA ATA ACA GTG TCT GAG AGC TAC CAC 2144 

|, Pro Met Lys Glu Cys Val Trp Lys He Thr Val Ser Glu Ser Tyr His 

30 1500 1505 1510 

GTC GGG CTG ACC TTT CAG TCC TTT GAG ATT GAA AGA CAT GAC AAT TGT 2192 
Val Gly Leu Thr Phe Gin Ser Phe Glu He Glu Arg His Asp Asn Cys 
1515 1520 1525 

GCT TAT GAC TAC CTG GAA GTT AGA GAT GGA ACC AGT GAA AAT AGC CCT 2240 
35 Ala Tyr Asp Tyr Leu Glu Val Arg Asp Gly Thr Ser Glu Asn Ser Pro 

1530 1535 1540 1545 

TTG ATA GGG CGT TTC TGT GGT TAT GAC AAA CCT GAA GAC ATA AGA TCT 2288 
Leu He Gly Arg Phe Cys Gly Tyr Asp Lys Pro Glu Asp He Arg Ser 
1550 1555 1560 

40 ACC TCC AAT ACT TTG TGG ATG AAG TTT GTT TCT GAC GGA ACT GTG AAC 2336 

Thr Ser Asn Thr Leu Trp Met Lys Phe Val Ser Asp Gly Thr Val Asn 
1565 1570 1575 

AAA GCA GGG TTT GCT GCT AAC TTT TTT AAA GAG GAA GAT GAG TGT GCC 2384 
Lys Ala Gly Phe Ala Ala Asn Phe Phe Lys Glu Glu Asp Glu Cys Ala 
^ D 1580 1585 1590 

AAA CCT GAC CGT GGA GGC TGT GAG CAG CGA TGT CTG AAC ACT CTG GGC 2432 
Lys Pro Asp Arg Gly Gly Cys Glu Gin Arg Cys Leu Asn Thr Leu Gly 
1595 1600 1605 

50 ter Sr Gin Cvl ti° V™ ^ ° CT GG ° TAT GAG CTG GGG CCA GA ? AGA 2480 

1610 YS 7 ^ G1U LeU Gly Pr ° As ^ Ar 9 

1615 1620 1625 



-35- 



i 



AGG AGC TGT GAA GCT GCT TGT GGT GGA CTT CTT ACC AAA CTT AAC GGC 2528 
Arg Ser Cys Glu Ala Ala Cys Gly Gly Leu Leu Thr Lys Leu Asn Gly 
1630 1635 1640 

ACC ATA ACC ACC CCT GGC TGG CCC AAG GAG TAC CCT CCT AAT AAG AAC 2576 
5 Thr lie Thr Thr Pro Gly Trp Pro Lys Glu Tyr Pro Pro Asn Lys Asn 

1645 1650 1655 

TGT GTG TGG CAA GTG GTT GCA CCA ACC CAG TAC AGA ATT TCT GTG AAG 2624 
Cys Val Trp Gin Val Val Ala Pro Thr Gin Tyr Arg lie Ser Val Lys 
1660 1665 1670 

10 TTT GAG TTT TTT GAA TTG GAA GGC AAT GAA GTT TGC AAA TAT GAT TAT 2672 

Phe Glu Phe Phe Glu Leu Glu Gly Asn Glu Val Cys Lys Tyr Asp Tyr 
1675 1680 1685 

GTG GAG ATC TGG AGT GGT CTT TCC TCT GAG TCT AAA CTG CAT GGC AAA 2720 
Val Glu lie Trp Ser Gly Leu Ser Ser Glu Ser Lys Leu His Gly Lys 
15 1690 1695 1700 1705 

TTC TGT GGC GCT GAA GTG CCT GAA GTG ATC ACA TCC CAG TTC AAC AAT 2768 
^ Phe Cys Gly Ala Glu Val Pro Glu Val He Thr Ser Gin Phe Asn Asn 

y 1710 1715 1720 

ATG AGA ATT GAA TTC AAA TCT GAC AAT ACT GTA TCC AAG AAG GGC TTC 2816 

J|0 Met Arg He Glu Phe Lys Ser Asp Asn Thr Val Ser Lys Lys Gly Phe 
21 1725 1730 1735 

P.! AAA GCA CAT TTT TTC TCA GAC AAA GAT GAA TGC TCT AAG GAT AAT GGT 28 64 

ill Lys Ala His Phe Phe Ser Asp Lys Asp Glu Cys Ser Lys Asp Asn Gly 
m 1740 1745 1750 

2,5 GGA TGT CAG CAC GAA TGT GTC AAC ACG ATG GGG AGC TAC ATG TGT CAA 2 912 



Gly Cys Gin His Glu Cys Val Asn Thr Met Gly Ser Tyr Met Cys Gin 
1755 1760 1765 



'5 TGC CGT AAT GGA TTT GTG CTA CAT GAC AAT AAA CAT GAT TGC AAG GAA 2 960 

cy S Arg Asn Gly Phe Val Leu His Asp Asn Lys His Asp Cys Lys Glu 
SO 1770 1775 1780 1785 

GCT GAG TGT GAA CAG AAG ATC CAC AGT CCA AGT GGC CTC ATC ACC AGT 3 008 
Ala Glu Cys Glu Gin Lys lie His Ser Pro Ser Gly Leu He Thr Ser 
1790 1795 1800 

CCC AAC TGG CCA GAC AAG TAC CCA AGC AGG AAA GAA TGC ACT TGG GAA 3056 
35 Pro Asn Trp Pro Asp Lys Tyr Pro Ser Arg Lys Glu Cys Thr Trp Glu 

1805 1810 1815 

ATC AGC GCC ACT CCC GGC CAC CGA ATC AAA TTA GCC TTT AGT GAA TTT 3104 
He Ser Ala Thr Pro Gly His Arg He Lys Leu Ala Phe Ser Glu Phe 
1820 1825 1830 

4 0 GAG ATT GAG CAG CAT CAA GAA TGT GCT TAT GAC CAC TTA GAA GTA TTT 3152 

Glu He Glu Gin His Gin Glu Cys Ala Tyr Asp His Leu Glu Val Phe 
1835 1840 1845 

GAT GGA GAA ACA GAA AAG TCA CCG ATT CTT GGA CGA CTA TGT GGC AAC 3200 
Asp Gly Glu Thr Glu Lys Ser Pro He Leu Gly Arg Leu Cys Gly Asn 
45 1850 1855 1860 1865 

AAG ATA CCA GAT CCC CTT GTG GCT ACT GGA AAT AAA ATG TTT GTT CGG 3248 
Lys He Pro Asp Pro Leu Val Ala Thr Gly Asn Lys Met Phe Val Arg 
1870 1875 1880 

TTT GTT TCT GAT GCA TCT GTT CAA AGA AAA GGC TTT CAA GCC ACA CAT 3296 
50 Phe Val Ser Asp Ala Ser Val Gin Arg Lys Gly Phe Gin Ala Thr His 

1885 1890 1895 
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TCT ACA GAG TGT GGC GGA CGA TTG AAA GCA GAA TCA AAA CCA AGA GAT 3344 
Ser Thr Glu Cys Gly Gly Arg Leu Lys Ala Glu Ser Lys Pro Arg Asp 
1900 1905 1910 

CTG TAC TCA CAT GCT CAG TTT GGT GAT AAC AAC TAC CCA GGA CAG GTT 3 3 92 
Leu Tyr Ser His Ala Gin Phe Gly Asp Asn Asn Tyr Pro Gly Gin Val 
1915 1920 1925 

GAC TGT GAA TGG CTA TTA GTA TCA GAA CGG GGC TCT CGA CTT GAA TTA 3440 
Asp Cys Glu Trp Leu Leu Val Ser Glu Arg Gly Ser Arg Leu Glu Leu 
"30 1935 1940 1945 

TCC TTC CAG ACA TTT GAA GTG GAG GAA GAA GCA GAC TGT GGC TAT GAC 3488 
Ser Phe Gin Thr Phe Glu Val Glu Glu Glu Ala Asp Cys Gly Tyr Asp 
"50 1955 i960 

TAT GTG GAG CTC TTT GAT GGT CTT GAT TCA ACA GCT GTG GGG CTT GGT 3536 
Tyr Val Glu Leu Phe Asp Gly Leu Asp Ser Thr Ala Val Gly Leu Gly 
1965 1970 1975 

CGA TTC TGT GGA TCC GGG CCA CCA GAA GAG ATT TAT TCA ATT GGA GAT 3584 
Arg Phe Cys Gly Ser Gly Pro Pro Glu Glu lie Tyr Ser He Gly Asp 
1980 1985 1990 

TCA GTT TTA ATT CAT TTC CAC ACT GAT GAC ACA ATC AAC AAG AAG GGA 3632 
Ser Val Leu He His Phe His Thr Asp Asp Thr He Asn Lys Lys Gly 
1995 2000 2005 

TTT CAT ATA AGA TAC AAA AGC ATA AGA TAT CCA GAT ACC ACA CAT ACC 3 680 
Phe Hxs He Arg Tyr Lys Ser He Arg Tyr Pro Asp Thr Thr His Thr 
2010 2015 2020 2025 

AAA AAA TAA CACCAAAACC TCTGTCAGAA CACAAAGGAA TGTGCATAAT ** 3 729 

Lys Lys * 

GGAGAGAAGA CATATTTTTT TTAAAACTGA AGATATTGGC ACAAATGTTT TATACAAAGA 3789 
GTTTGAACAA AAAATCCCTG TAAGACCAGA ATTATCTTTG TACTAAAAGA GAAGTTTCCA 3849 
GCAAAACCCT CATCAGCATT ACAAGGATAT TTGAACTCCA TGCTTGATGG TATTAATAAA 3909 
GCTGGTGAAA 
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(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1014 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

Met Gly Leu Gly Thr Leu Ser Pro Arg Met Leu Val Trp Leu Val Ala 
1 5 10 15 

Ser Gly He Val Phe Tyr Gly Glu Leu Trp Val Cys Ala Gly 7 Leu Asp 
20 25 30 / 

Tyr Asp Tyr Thr Phe Asp Gly Asn Glu Glu Asp Lys Thr Glu Thr He 
35 40 45 

Asp Tyr Lys Asp Pro Cys Lys Ala Ala Val Phe Trp Gly Asp He Ala 
50 55 60 



Leu Asp Asp Glu Asp Leu Asn lie Phe Gin He Asp Arg Thr He Asp 
65 70 75 80 

Leu Thr Gin Asn Pro Phe Gly Asn Leu Gly His Thr Thr Gly Gly Leu 
85 90 95 

Gly Asp His Ala Met Ser Lys Lys Arg Gly Ala Leu Tyr Gin Leu He 
100 105 no 

Asp Arg He Arg Arg He Gly Phe Gly Leu Glu Gin Asn Asn Thr Val 
115 120 125 

Lys Gly Lys Val Pro Leu Gin Phe Ser Gly Gin Asn Glu Lys Asn Arg 
130 135 140 

Val Pro Arg Ala Ala Thr Ser Arg Thr Glu Arg He Trp Pro Gly Gly 
145 150 155 160 

Val He Pro Tyr Val He Gly Gly Asn Phe Thr Gly Ser Gin Arg Ala 
165 170 175 

Met Phe Lys Gin Ala Met Arg His Trp Glu Lys His Thr Cys Val Thr 
180 185 190 

Phe He Glu Arg Ser Asp Glu Glu Ser Tyr He Val Phe Thr Tyr Arg 
195 200 205 

Pro Cys Gly Cys Cys Ser Tyr Val Gly Arg Arg Gly Asn Gly Pro Gin 
210 215 220 

Ala He Ser He Gly Lys Asn Cys Asp Lys Phe Gly He Val Val His 
225 230 235 .240 

Glu Leu Gly His Val He Gly Phe Trp His Glu His Thr Arg Pro Asp 
245 250 255 

Arg Asp Asn His Val Thr He He Arg Glu Asn He Gin Pro Gly Gin 
260 265 270 

Glu Tyr Asn Phe Leu Lys Met Glu Pro Gly Glu Val Asn Ser Leu Gly 
275 280 285 

Glu Arg Tyr Asp Phe Asp Ser He Met His Tyr Ala Arg Asn Thr Phe 
290 295 300 

Ser Arg Gly Met Phe Leu Asp Thr He Leu Pro Ser Arg Asp Asp Asn 
305 310 315 320 

Gly He Arg Pro Ala He Gly Gin Arg Thr Arg Leu Ser Lys Gly Asp 
325 330 335 

He Ala Gin Ala Arg Lys Leu Tyr Arg Cys Pro Ala Cys Gly Glu Thr 
340 345 350 

Leu Gin Glu Ser Asn Gly Asn Leu Ser Ser Pro Gly Phe Pro Asn Gly 
355 360 365 

Tyr Pro Ser Tyr Thr His Cys He Trp Arg Val Ser Val Thr Pro Gly 
370 375 380 

Glu Lys He Val Leu Asn Phe Thr Thr Met Asp Leu Tyr Lys Ser Ser 
385 390 395 400 

Leu Cys Trp Tyr Asp Tyr lie Glu Val Arg Asp Gly Tyr Trp Arg Lys 
405 410 415 
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Ser Pro Leu Leu Gly Arg Phe Cys Gly Asp Lys Leu Pro Glu Val Leu 
420 425 430 

Thr Ser Thr Asp Ser Arg Met Trp He Glu Phe Arg Ser Ser Ser Asn 
435 440 445 

Trp Val Gly Lys Gly Phe Ala Ala Val Tyr Glu Ala He Cys Gly Gly 
450 455 460 

Glu He Arg Lys Asn Glu Gly Gin He Gin Ser Pro Asn Tyr Pro Asp 
465 470 475 480 

Asp Tyr Arg Pro Met Lys Glu Cys Val Trp Lys He Thr Val Ser Glu 

485 490 495 

Ser Tyr His Val Gly Leu Thr Phe Gin Ser Phe Glu He Glu Arg His 
500 505 510 

Asp Asn Cys Ala Tyr Asp Tyr Leu Glu Val Arg Asp Gly Thr Ser Glu 
515 520 525 

Asn Ser Pro Leu He Gly Arg Phe Cys Gly Tyr Asp Lys Pro Glu Asp 
530 535 540 

He Arg Ser Thr Ser Asn Thr Leu Trp Met Lys Phe Val Ser Asp Gly 
545 550 555 560 

Thr Val Asn Lys Ala Gly Phe Ala Ala Asn Phe Phe Lys Glu Glu Asp 
565 570 575 

Glu Cys Ala Lys Pro Asp Arg Gly Gly Cys Glu Gin Arg Cys Leu Asn 
580 585 590 

Thr Leu Gly Ser Tyr Gin Cys Ala Cys Glu Pro Gly Tyr Glu Leu Gly 
595 600 605 

Pro Asp Arg Arg Ser Cys Glu Ala Ala Cys Gly Gly Leu Leu Thr Lys 
610 615 620 

Leu Asn Gly Thr He Thr Thr Pro Gly Trp Pro Lys Glu Tyr Pro Pro 
625 630 635 640 

Asn Lys Asn Cys Val Trp Gin Val Val Ala Pro Thr Gin Tyr Arg He 
645 650 655 

Ser Val Lys Phe Glu Phe Phe Glu Leu Glu Gly Asn Glu Val Cys Lys 
660 665 670 

Tyr Asp Tyr Val Glu He Trp Ser Gly Leu Ser Ser Glu Ser Lys Leu 
675 680 685 

His Gly Lys Phe Cys Gly Ala Glu Val Pro Glu Val He Thr Ser Gin 
690 695 700 

Phe Asn Asn Met Arg He Glu Phe Lys Ser Asp Asn Thr Val Ser Lys 
705 710 715 720 

Lys Gly Phe Lys Ala His Phe Phe Ser Asp Lys Asp Glu Cys Ser Lys 
725 730 735 

Asp Asn Gly Gly Cys Gin His Glu Cys Val Asn Thr Met Gly Ser Tyr 
740 745 750 

Met Cys Gin Cys Arg Asn Gly Phe Val Leu His Asp Asn Lys His Asp 
755 760 765 



Cys Lys Glu Ala Glu Cys Glu Gin Lys He His Ser Pro Ser Gly Leu 
770 775 , 780 

He Thr Ser Pro Asn Trp Pro Asp Lys Tyr Pro Ser Arg Lys Glu Cys 
785 790 795 800 

5 Thr Trp Glu He Ser Ala Thr Pro Gly His Arg He Lys Leu Ala Phe 

805 810 815 

Ser Glu Phe Glu He Glu Gin His Gin Glu Cys Ala Tyr Asp His Leu 
820 825 830 

Glu Val Phe Asp Gly Glu Thr Glu Lys Ser Pro He Leu Gly Arg Leu 
10 835 840 845 

Cys Gly Asn Lys He Pro Asp Pro Leu Val Ala Thr Gly Asn Lys Met 
850 855 860 

Phe Val Arg Phe Val Ser Asp Ala Ser Val Gin Arg Lys Gly Phe Gin 
_ ; 865 870 875 880 

Ala Thr His Ser Thr Glu Cys Gly Gly Arg Leu Lys Ala Glu Ser Lys 
885 890 895 



~jh Pro Arg Asp Leu Tyr Ser His Ala Gin Phe Gly Asp Asn Asn Tyr Pro 

iU 900 905 910 

S Gly Gin Val Asp Cys Glu Trp Leu Leu Val Ser Glu Arg Gly Ser Arq 

IP 915 920 925 

s Leu Glu Leu Ser Phe Gin Thr Phe Glu Val Glu Glu Glu Ala Asp Cys 

O 930 . 935 940 s - 

J! Gly Tyr Asp Tyr Val Glu Leu Phe Asp Gly Leu Asp Ser Thr Ala Val 

3^ 945 950 955 960 

Gly Leu Gly Arg Phe Cys Gly Ser Gly Pro Pro Glu Glu He Tyr Ser 
|U 965 970 975 

He Gly Asp Ser Val Leu He His Phe His Thr Asp Asp Thr He Asn 
980 985 990 

Lys Lys Gly Phe His He Arg Tyr Lys Ser He Arg Tyr Pro Asp Thr 
30 9 ^5 1000 1005 

Thr His Thr Lys Lys * 
1010 

(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 
35 (A) LENGTH r 2 0 base pairs 

<B) TYPE; nucleic acid 

(C) STRANDEDNES S : single 

(D) TOPOLOGY: linear 
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<ii) MOLECULE TYPE; other nucleic acid 

(A) DESCRIPTION: /desc - "oligonucleotide primer'* 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 6 : 
CCAGCTTAAC CTGTTCACAC 



20 
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(2) INFORMATION FOR SEQ ID NO: 7: 



(i> SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

5 (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "oligonucleotide primer" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 
10 AACTCTACTT CCACTTCATC 20 

(2) INFORMATION FOR SEQ ID NO: 8:. 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 
,4 5 (C) STRANDEDNESS: single 
4? (D) TOPOLOGY: linear 

*D (ii) MOLECULE TYPE: other nucleic acid 

£ (A) DESCRIPTION: /desc = "oligonucleotide primer" 

W (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 

^0 TCAGAACAGA AAGGAATGTG 20 

Q (2) INFORMATION FOR SEQ ID NO: 9: 

pi (i) SEQUENCE CHARACTERISTICS: 

"J! (A) LENGTH: 20 base pairs 

« (B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
3** (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "oligonucleotide primer" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 
3 0 GACCACTATT CCACATCACC 

(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 
35 (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "oligonucleotide primer" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 
4 0 TCTTGCAGTC AGTTGCTTTG CTGG 



20 
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(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 base pairs 

(B) TYPE: nucleic acid 

5 (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "oligonucleotide primer 1 * 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO; 11: 
10 TAGTGCGGCC GCACATTCCT TTGTGTTC 28 

(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

O (A) LENGTH: 18 amino acids 

™S (B) TYPE: amino acid 

.13-5 (C) STRANDEDNESS; 

~g (D) TOPOLOGY: linear 

ft] (ii) MOLECULE TYPE: peptide 

y (xi) SEQUENCE DESCRIPTION: SEQ ID NO : 12 : 

s Cys He Arg Tyr Lys Ser He Arg Tyr Pro Glu Thr Met His Ala 

S=of U 1 5 10 *l cr 

^ Lys Asn 



(2) INFORMATION FOR SEQ ID NO : 13 : 



(i) SEQUENCE CHARACTERISTICS: 
25 (A) LENGTH: 18 amino acids 

<B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



30 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 

Cys His He Arg Tyr Lys Ser He Arg Tyr Pro Asp Thr Thr His Thr 
1 5 10 15 

Lys Lys 



35 



GBMAD1VI34023 
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